Stretchy Time Pattern Mining: A Deeper Analysis of Environment Sensor Data
نویسندگان
چکیده
Mining sequential patterns on environment sensor data is a challenging task; the data can present noises and may also contain sparse patterns, which are difficult to be detected. The knowledge extracted from environment sensor data can be used to determine climate changes. However, there is a lack of methods that can handle this kind of database. In this paper, we propose a method to mine sequential patterns in sparse, incomplete and noisy sensor data. The proposed method, called Stretchy Time Windows (STW), allows the mining of sequential patterns that present time gaps between their events. We propose an algorithm to implement STW, called Miner of Stretchy Time Sequences (MSTS). The proposed algorithm works with sequences of any size and uses a balanced strategy to analyze the search space. Our experiments show that MSTS returns sequences that have a longer period of analysis than GSP a traditional frequent pattern mining algorithm. In fact, 5 times larger than GSP and higher number of patterns (2.3 times) when compared to previous methods.
منابع مشابه
Incremental Mining of Frequent Sequences in Environmental Sensor Data
The mining of sequential patterns in environment sensor data is a challenging task. Most of sequential mining techniques requires periodically complete data. Furthermore, this kind of data can be incomplete, present noises and be sparse in time. Consequently, there is a lack of methods that can mine sequential patterns in sensor data. In this paper, we proposed IncMSTS, an incremental algorithm...
متن کاملOutlier Detection in Wireless Sensor Networks Using Distributed Principal Component Analysis
Detecting anomalies is an important challenge for intrusion detection and fault diagnosis in wireless sensor networks (WSNs). To address the problem of outlier detection in wireless sensor networks, in this paper we present a PCA-based centralized approach and a DPCA-based distributed energy-efficient approach for detecting outliers in sensed data in a WSN. The outliers in sensed data can be ca...
متن کاملAnalysis and Forecast of Mining Accidents in Pakistan
In the mining sector, the barrier to obtain an efficient safety management system is the unavailability of future information regarding the accidents. This paper aims to use the auto-regressive integrated moving average (ARIMA) model, for the first time, to evaluate the underlying causes that affect the safety management system corresponding to the number of accidents and fatalities in the surf...
متن کاملA comparison between knowledge-driven fuzzy and data-driven artificial neural network approaches for prospecting porphyry Cu mineralization; a case study of Shahr-e-Babak area, Kerman Province, SE Iran
The study area, located in the southern section of the Central Iranian volcano–sedimentary complex, contains a large number of mineral deposits and occurrences which is currently facing a shortage of resources. Therefore, the prospecting potential areas in the deeper and peripheral spaces has become a high priority in this region. Different direct and indirect methods try to predict promising a...
متن کاملRock mass structural data analysis using image processing techniques (Case study: Choghart iron ore mine northern slopes)
Presence of joints and fractures in rocks strongly influences the behavior of the rock mass by dividing the media into smaller units. These structures intensify the potential instability besides the development of sliding and rotational movements. The assumption of discontinuum media changes the whole analysis conditions in relation to the continuum analysis. Acquisition of geometrical and stru...
متن کامل